SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: HIGHFIELD, PETER EDMUND 
RODGERS, BRIAN COLIN 
TEDDER, RICHARD SETON 
BARBARA, JOHN ANTHONY JAMES 

<ii) TITLE OF INVENTION: VIRAL AGENT 

(iii) NUMBER OF SEQUENCES: 25 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE P.C. 

(B) STREET: 1100 NORTH GLEBE ROAD 

(C) CITY: ARLINGTON 

(D) STATE: VIRGINIA 

(E) COUNTRY: U.S.A. 

(F) ZIP : 22201 
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(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy diskette, 5.25 inch, 360 Kb storage 

(B) COMPUTER: IBM AT Compatible 

(C) OPERATING SYSTEM: MS-DOS V3 . 2 

(D) SOFTWARE: WordPerfect 5.0 {DOS text) 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/664,363 

(B) FILING DATE: 18 SEP 2000 

( C ) CLASSIFICATION : 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/191,160 

(B) FILING DATE: 03 FEB 1994 

(C) CLASSIFICATION: 435/235 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/628,516 

(B) FILING DATE: 17 DEC 1990 

(C) CLASSIFICATION: 435/235 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: UK 89 28 562.1 

(B) FILING DATE: 18 DEC 1989 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: UK 90 04 414.0 

(B) FILING DATE: 27 FEB 1990 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: UK 90 04 814.1 

(B) FILING DATE: 03 MAR 1990 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: MARY J. WILSON 

(B) REGISTRATION NUMBER: 32,955 

(C) REFERENCE/DOCKET NUMBER: 2035-38 



(xi) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 816-4011 

(B) TELEFAX: (703) 816-4100 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: bacteriophage lambda gtll 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Oligonucleotide synthesizer; oligo dl9 

(ix) FEATURE: 

(B) LOCATION: from 1 to 21 bases homologous to upstream portion 
of lacZ gene flanking the EcoRl site in bacteriophage 
lambda gtll 

(D) OTHER INFORMATION: primes DNA synthesis from the phage 
vector into cDNA inserted at the EcoRl site. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGTGGCGACG ACTCCTGGAG C 21 




(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda gtll 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Oligonucleotide synthesizer; oligo d20 

(ix) FEATURE: 

(B) LOCATION: from 1 to 21 bases homologous to downstream 
portion of lacZ gene flanking the EcoRl site in 
bacteriophage lambda gtll 

(D) OTHER INFORMATION: primes DNA synthesis from the phage 
vector into cDNA inserted at the EcoRl 
site. 



It ' 

" (xi) SEQUENCE DESCRIPTION: SEQ ID N0:2i 
TTGACACCAG ACCAACTGGT A 21 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1770 base pairs 

(B) TYPE: nucleotide with corresponding protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to genomic RNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: clone JG2 from cDNA library in lambda gtll 

( ix ) FEATURE : 

(B) LOCATION: from 1 to 1770 bp portion of the PT-NANBH 
polyprotein 

(D) OTHER INFORMATION: probably encodes viral non-structural 
proteins 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CAA AAT GAC TTC CCA GAC GCT GAC CTC ATC GAG GCC AAC CTC CTG TGG 48 
Gin Asn Asp Phe Pro Asp Ala Asp Leu lie Glu Ala Asn Leu Leu Trp 

5 10 15 

CGG CAT GAG ATG GGC GGG GAC ATT ACC CGC GTG GAG TCA GAG AAC AAG 96 
Arg His Glu Met Gly Gly Asp lie Thr Arg Val Glu Ser Glu Asn Lys 
20 25 30 

GTA GTA ATC CTG GAC TCT TTC GAC CCG CTC CGA GCG GAG GAG GAT GAG 144 
Val Val lie Leu Asp Ser Phe Asp Pro Leu Arg Ala Glu Glu Asp Glu 
35 40 45 

CGG GAA GTG TCC GTC CCG GCG GAG ATC CTG CGG AAA TCC AAG AAA TTC 192 
Arg Glu Val Ser Val Pro Ala Glu lie Leu Arg Lys Ser Lys Lys Phe 
50 55 60 

CCA CCA GCG ATG CCC GCA TGG GCA CGC CCG GAT TAC AAC CCT CCG CTG 240 
Pro Pro Ala Met Pro Ala Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu 
65 , 70 75 80 

CTG GAG TCC TGG AAG GCC CCG GAC TAC GTC CCT CCA GTG GTA CAT GGG 288 
Leu Glu Ser Trp Lys Ala Pro Asp Tyr Val Pro Pro Val Val His Gly 

85 90 95 



TGC CCA CTG CCA CCT ACT A^ ACC CCT CCT ATA CCA CCT (91 CGG AGA 336 

Cys Pro Leu Pro" Pro 'Thr Lys Thr Pro Pro lie Pro Pro Pro Arg Arg 
100 105 110 

AAG AGG ACA GTT GTT CTG ACA GAA TCC ACC GTG TCT TCT GCC CTG GCG 384 
Lys Arg Thr Val Val Leu Thr Glu Ser Thr Val Ser Ser Ala Leu Ala 
115 120 125 

GAG CTT GCC ACA AAG GCT TTT GGT AGC TCC GGA CCG TCG GCC GTC GAC 432 
Glu Leu Ala Thr Lys Ala Phe Gly Ser Ser Gly Pro Ser Ala Val Asp 
130 135 140 

AGC GGC ACG GCA ACC GCC CCT CCT GAC CAA TCC TCC GAC GAC GGC GGA 480 
Ser Gly Thr Ala Thr Ala Pro Pro Asp Gin Ser Ser Asp Asp Gly Gly 
145 150 155 160 

GCA GGA TCT GAC GTT GAG TCG TAT TCC TCC ATG CCC CCC CTT GAG GGG 528 
Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly 

165 170 175 

GAG CCG GGG GAC CCC GAT CTC AGC GAC GGG TCT TGG TCT ACC GTG AGT 576 
Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser 
180 185 190 

GAG GAG GCC GGT GAG GAC GTC GTC TGC TGC TCG ATG TCC TAC ACA TGG 624 
Glu Glu Ala Gly Glu Asp Val Val Cys Cys Ser Met Ser Tyr Thr Trp 
195 200 205 

ACA GGC GCT CTG ATC ACG CCA TGC GCT GCG GAG GAA AGC AAG CTG CCC 672 
Thr Gly Ala Leu He Thr Pro Cys Ala Ala Glu Glu Ser Lys Leu Pro 
210 215 220 

ATC AAC GCG TTG AGC AAC TCT TTG CTG CGT CAC CAC AAC ATG GTC TAC 720 
He Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Met Val Tyr 
225 230 235 240 

GCT ACC ACA TCC CGC AGC GCA AGC CAG CGG CAG AAG AAG GTC ACC TTT 768 
Ala Thr Thr Ser Arg Ser Ala Ser Gin Arg Gin Lys Lys Val Thr Phe 

245 250 255 

GAC AGA CTG CAA ATC CTG GAC GAT CAC TAC CAG GAC GTG CTC AAG GAG 816 
Asp Arg Leu Gin He Leu Asp Asp His Tyr Gin Asp Val Leu Lys Glu 
260 265 270 

ATG AAG GCG AAG GCG TCC ACA GTT AAG GCT AAG CTT CTA TCA GTA GAG 864 
Met Lys Ala Lys Ala Ser Thr Val Lys Ala Lys Leu Leu Ser Val Glu 
275 280 285 

GAA GCC TGC AAG CTG ACG CCC CCA CAT TCG GCC AAA TCT AAA TTT GGC 912 
Glu Ala Cys Lys Leu Thr Pro Pro His Ser Ala Lys Ser Lys Phe Gly 
290 295 300 



TAT GGG GCA AAG GAC GTC CG^AAC CTA TCC AGC AAG GCC A^ AAC CAC 960 
Tyr Gly Ala Lys' Asp Val Arg Asn Leu Ser Ser Lys Ala lie Asn His 
305 310 315 320 

ATC CGC TCC GTG TGG GAG GAC TTG TTG GAA GAC ACT GAA ACA CCA ATT 1008 
He Arg Ser Val Trp Glu Asp Leu Leu Glu Asp Thr Glu Thr Pro He 

325 330 335 

GAC ACC ACC ATC ATG GCA AAA AAT GAG GTT TTC TGC GTC CAA CCA GAG 1056 
Asp Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu 
340 345 350 

AGA GGA GGC CGC AAG CCA GCT CGC CTT ATC GTG TTC CCA GAC TTG GGG 1104 
Arg Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly 
355 360 365 

GTC CGT GTG TGC GAG AAA ATG GCC CTC TAT GAC GTG GTC TCC ACC CTC 1152 
Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Ser Thr Leu 
370 375 380 

CCT CAG GCT GTG ATG GGC TCC TCG TAC GGA TTC CAG TAT TCT CCT GGA 1200 
Pro Gin Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly 
385 390 395 400 

CAG CGG GTC GAG TTC CTG GTG AAC GCC TGG AAA TCA AAG AAG ACC CCT 1248 
Gin Arg Val Glu Phe Leu Val Asn Ala Trp Lys Ser Lys Lys Thr Pro 

405 410 . 415 

ATG GGC TTT GCA TAT GAC ACC CGC TGT TTT GAC TCA ACA GTC ACT GAG 1296 
Met Gly Phe Ala Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 
420 425 430 

AAT GAC ATC CGT GTA GAG GAG TCA ATT TAT CAA TGT TGT GAC TTG GCC ,1344 
Asn Asp He Arg Val Glu Glu Ser He Tyr Gin Cys Cys Asp Leu Ala 
435 440 445 

CCC GAA GCC AGA CAG GCC ATA AGG TCG CTC ACA GAG CGG CTT TAT ATC 1392 
Pro Glu Ala Arg Gin Ala He Arg Ser Leu Thr Glu Arg Leu Tyr He 
450 455 460 

GGG GGT CCC CTG ACT AAT TCA AAA GGG CAG AAC TGC GGC TAT CGC CGG 1440 
Gly Gly Pro Leu Thr Asn Ser Lys Gly Gin Asn Cys Gly Tyr Arg Arg 
465 470 475 480 

TGC CGC GCG AGC GGC GTG CTG ACG ACT AGC TGC GGT AAT ACC CTC ACA 1488 
Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr 

485 490 495 

TGT TAC TTG AAG GCC TCT GCA GCC TGT CGA GCT GCA AAG CTC CAG GAC 1536 
Cys Tyr Leu Lys Ala Ser Ala Ala Cys Arg Ala Ala Lys Leu Gin Asp 
500 505 510 



TGC ACG ATG CTC GTG TGC G^ GAC GGC CTT GTC GTT ATC GAG AGC 1584 

Cys Thr Met Leu* Val Cys Gly Asp Asp Leu Val Val lie Cys Glu Ser 
515 520 525 

GCG GGA ACC CAG GAG GAC GCG GCG AGC CTA CGA GTC TTC ACG GAG GCT 1632 
Ala Gly Thr Gin Glu Asp Ala Ala Ser Leu Arg Val Phe Thr Glu Ala 
530 535 540 

ATG ACT AGG TAC TCT GCC CCC CCC GGG GAC CCG CCC CAA CCA GAA TAC 1680 
Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr 
545 550 555 560 

GAC CTG GAG TTG ATA ACA TCA TGC TCC TCC AAT GTG TCG GTC GCG CAC 1728 
Asp Leu Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala His 

565 570 575 

GAT GCA TCT GGC AAA AGG GTA TAC TAC CTC ACC CGT GAC CCG 1770 
Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro 
580 585 590 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 base pairs 

(B) TYPE: nucleotide with corresponding protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to genomic RNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human? serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: clone JG3 from cDNA library in lambda gtll 

( ix ) FEATURE : 

(B) LOCATION: from 1 to 1035 bp portion of the PT-NANBH 
polyprotein 

(D) OTHER INFORMATION: probably encodes viral non-structural 
proteins 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ACA GAA GTG GAT GGG GTG CGG CTG CAC AGG TAC GCT CCG GCG TGC AAA 48 
Thr Glu Val Asp Gly Val Arg Leu His Arg Tyr Ala Pro Ala Cys Lys 
.5 10 15 

CCT CTC CTA CGG GAG GAG GTC ACA TTC CAG GTC GGG CTC AAC CAA TAC 96 
Pro Leu Leu Arg Glu Glu Val Thr Phe Gin Val Gly Leu Asn Gin Tyr 
20 25 30 



CTG GTT GGG TCG CAG CTC CC^TGC GAG CCC GAA CCG GAT d9& GCA GTG 144 

Leu Val Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val Ala Val 
35 40 45 

CTC ACT TCC ATG CTC ACC GAC CCC TCC CAC ATC ACA GCA GAG ACG GCT 192 
Leu Thr Ser Met Leu Thr Asp Pro Ser His lie Thr Ala Glu Thr Ala 
50 55 60 

AAG CGC AGG CTG GCC AGG GGG TCT CCC CCC TCC TTG GCC AGC TCT TCA 240 
Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Leu Ala Ser Ser Ser 
65 70 75 80 

GCT AGC CAG TTG TCT GGC CCT TCC TCG AAG GCG ACA TAC ATT ACC CAA 288 
Ala Ser Gin Leu Ser Gly Pro Ser Ser Lys Ala Thr Tyr lie Thr Gin 

85 90 95 

AAT GAC TTC CCA GAC GCT GAC CTC ATC GAG GCC AAC CTC CTG TGG CGG 336 
Asn Asp Phe Pro Asp Ala Asp Leu lie Glu Ala Asn Leu Leu Trp Arg 
100 105 110 

CAT GAG ATG GGC GGG GAC ATT ACC CGC GTG GAG TCA GAG AAC AAG GTA 384 
His Glu Met Gly Gly Asp lie Thr Arg Val Glu Ser Glu Asn Lys Val 
115 120 125 

GTA ATC CTG GAC TCT TTC GAC CCG CTC CGA GCG GAG GAG GAT GAG CGG 432 
Val lie Leu Asp Ser Phe Asp Pro Leu Arg Ala Glu Glu Asp Glu Arg 
130 135 140 

GAA GTG TCC GTC CCG GCG GAG ATC CTG CGG AAA TCC AAG AAA TTC CCA 480 
Glu Val Ser Val Pro Ala Glu lie Leu Arg Lys Ser Lys Lys Phe Pro 
145 150 155 160 

CCA GCG ATG CCC GCA TGG GCA CGC CCG GAT TAC AAC CCT CCG CTG CTG 528 
Pro Ala Met Pro Ala Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Leu 

165 170 175 

GAG TCC TGG AAG GCC CCG GAC TAC GTC CCT CCA GTG GTA CAT GGG TGC 576 
Glu Ser Trp Lys Ala Pro Asp Tyr Val Pro Pro Val Val His Gly Cys 
180 185 190 

CCA CTG CCA CCT ACT AAG ACC CCT CCT ATA CCA CCT CCA CGG AGA AAG 624 
Pro Leu Pro Pro Thr Lys Thr Pro Pro lie Pro Pro Pro Arg Arg Lys 
195 200 205 

AGG ACA GTT GTT CTG ACA GAA TCC ACC GTG TCT TCT GCC CTG GCG GAG 672 
Arg Thr Val Val Leu Thr Glu Ser Thr Val Ser Ser Ala Leu Ala Glu 
210 215 220 

CTT GCC ACA AAG GCT TTT GGT AGC TCC GGA CCG TCG GCC GTC GAC AGC 720 
Leu Ala Thr Lys Ala Phe Gly Ser Ser Gly Pro Ser Ala Val Asp Ser 
225 230 235 240 



GGC ACG GCA ACC GCC CCT CC^GAC CAA TCC TCC GAC GAC G^ GGA GCA 

Gly Thr Ala Thr' Ala "Pro Pro Asp Gin Ser Ser Asp Asp Gly Gly Ala 

245 250 255 



768 



GGA TCT GAC GTT GAG TCG TAT TCC TCC ATG CCC CCC CTT GAG GGG GAG 816 
Gly Ser Asp Val Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu 
260 265 270 

CCG GGG GAC CCC GAT CTC AGC GAC GGG TCT TGG TCT ACC GTG AGT GAG 864 
Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val Ser Glu 
275 280 285 

GAG GCC GGT GAG GAC GTC GTC TGC TGC TCG ATG TCC TAC ACA TGG ACA 912 
Glu Ala Gly Glu Asp Val Val Cys Cys Ser Met Ser Tyr Thr Trp Thr 
290 295 300 

GGC GCT CTG ATC ACG CCA TGC GCT GCG GAG GAA AGC AAG CTG CCC ATC 960 
Gly Ala Leu lie Thr Pro Cys Ala Ala Glu Glu Ser Lys Leu Pro lie 
305 310 315 320 

AAC GCG TTG AGC AAC TCT TTG CTG CGT CAC CAC AAC ATG GTC TAC GCT 1008 
Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Met Val Tyr Ala 

325 330 335 

ACC ACA TCC CGC AGC GCA AGC CAG CGG 1035 
Thr Thr Ser Arg Ser Ala Ser Gin Arg 
340 345 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleotide with corresponding protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to genomic RNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: clone BR11 from cDNA library in lambda gtll 

(ix) FEATURE: 

(B) LOCATION: from 1 to 834 bp portion of the PT-NANBH 
polyprotein 

(D) OTHER INFORMATION: probably encodes viral structural 
proteins 



« (xi) SEQUENCE DESCRIP^In: SEQ ID N0:5: 

AGA AAA ACC AAA CGT AAC ACC AAC CTC CGC CCA CAG GAC GTC AGG TTC 
Arg Lys Thr Lys Arg Asn Thr Asn Leu Arg Pro Gin Asp Val Arg Phe 

5 10 15 



48 



CCG GGC GGT GGT CAG ATC GTT GGT GGA GTT TAC CTG TTG CCG CGC AGG 96 
Pro Gly Gly Gly Gin lie Val Gly Gly Val Tyr Leu Leu Pro Arg Arg 
20 25 30 

GGC CCC AGG TTG GGT GTG CGC GCG ACT AGG AAG ACT TCC GAG CGG TCG 144 
Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser 
35 40 45 

CAA CCT CGT GGA AGG CGA CAA CCT ATC CCC AAG GCT CGC CAG CCC GAG 192 
Gin Pro Arg Gly Arg Arg Gin Pro lie Pro Lys Ala Arg Gin Pro Glu 
50 55 60 

GGC AGG GCC TGG GCT CAG CCC GGG TAC CCT TGG CCC CTC TAT GGC AAC 240 
Gly Arg Ala Trp Ala Gin Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn 
65 70 75 80 

GAG GGC ATG GGG TGG GCA GGA TGG CTC CTG TCA CCC CGT GGC TCC CGG 288 
Glu Gly Met Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg 

85 90 95 

CCT AGT TGG GGC CCC ACT GAC CCC CGG CGT AGG TCG CGT AAT TTG GGT 336 
Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly 
100 105 110 

AAA GTC ATC GAT ACC CTC ACA TGC GGC TTC GCC GAC TCT CAT GGG GTA 384 
Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Ser His Gly Val 
115 120 125 

CAT TCC GCT CGT CGG CGC TCC CTT AGG GGC GCT GCC AGG GCC CTG GCG 432 
His Ser Ala Arg Arg Arg Ser Leu Arg Gly Ala Ala Arg Ala Leu Ala 
130 135 140 

CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA ACA GGG AAT 480 
His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn 
145 150 155 160 

TTA CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG CTG TCC TGT 528 
Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 

165 170 175 

TTG ACC ATT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG TCC GGG ATC 576 
Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly lie 
180 185 190 

TAC CAT GTC ACG AAC GAT TGC TCC AAC TCA AGC ATC GTG TAC GAG ACA 624 
Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Thr 
195 200 205 



GCG GAC ATG ATC ATG CAC AC^CCC GGG TGT GTG CCC TGT G^CGG GAG 

Ala Asp Met lie' Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
210 215 220 

GGT AAT TCC TCC CGC TGC TGG GTA GCG CTC ACT CCC ACG CTC GCG GCC 
Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
225 230 235 240 



672 



720 



AAG GAC GCC AGC ATC CCC ACT GCG ACA ATA CGA CGC CAC GTC GAT TTG 
Lys Asp Ala Ser lie Pro Thr Ala Thr lie Arg Arg His Val Asp Leu 

245 250 255 



768 



CTC GTT GGG GCG GCT GCC TTC TCG TCC GCT ATG TAC GTG GGG GAT CTC 
Leu Val Gly Ala Ala Ala Phe Ser Ser Ala Met Tyr Val Gly Asp Leu 
260 265 270 



816 



TGC GGA TCT GTT TTC CCG 
Cys Gly Ser Val Phe Pro 
275 



834 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

( B ) TYPE : nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda gtll 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Oligonucleotide synthesizer; oligo d75 

( ix ) FEATURE : 

(B) LOCATION: from 4 to 9 bases BamHl site, from 10 to 31 bases 
homologous to upstream portion of lacZ gene flanking the 
EcoRl site in bacteriophage lambda gtll from 26 to 31 bases 
EcoRl site 

(D) OTHER INFORMATION: primes DNA synthesis from the phage 

vector into cDNA inserted at the EcoRl site and introduces 
a BamHl site suitable for subsequent cloning into 
expression vectors. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



TAAGGATCCC CCGTCAGTAT CGGCGGAATT C 



31 



(2) INFORMATION FOR SEQ I 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda gtll 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Oligonucleotide synthesizer; oligo d76 

(ix) FEATURE: 

(B) LOCATION: from 4 to 9 bases BamHl site from 10 to 30 bases 
homologous to downstream portion of lacZ gene flanking the 
EcoRl site in bacteriophage lambda gtll 

(D) OTHER INFORMATION: primes DNA synthesis from the phage 

vector into cDNA inserted at the EcoRl site and introduces 
a BamHl site suitable for subsequent cloning into 
expression vectors. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TATGGATCCG TAGCGACCGG CGCTCAGCTG 30 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 

( b ) TYPE : nuc leo tide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo d94 

( ix ) FEATURE : 

(B) LOCATION: from 1 to 19 bases homologous to bases 914 to 
932 of the sense strand of JG2 (SEQ ID NO: 3) 

(D) OTHER INFORMATION: primes DNA synthesis on the negative 
strand of PT-NANBH genomic RNA/DNA. 



(xi) SEQUENCE DESCRIPTTON: SEQ ID NO: 8: 
ATGGGGCAAA GGACGTCCG 



19 



(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo d95 

( ix ) FEATURE : 

(B) LOCATION: from 1 to 24 bases homologous to bases 1620 to 
1643 of the anti-sense strand of JG2 (SEQ ID NO: 3) 

(D) OTHER INFORMATION: primes DNA synthesis on the positive 
strand of PT-NANBH genomic RNA/DNA. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TACCTAGTCA TAGCCTCCGT GAAG 24 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo Nl 

(ix) FEATURE: 

(B) LOCATION: from 1 to 17 bases homologous to bases 1033 to 
1049 of the sense strand of JG2 (SEQ ID NO: 3) 

(D) OTHER INFORMATION : primes DNA synthesis on the negative 
strand of PT-NANBH genomic RNA/DNA. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GAGGTTTTCT GCGTCCA 



17 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo N2 

(ix) FEATURE: 

(B) LOCATION: from 1 to 17 bases homologous to bases 1421 to 
1437 of the anti-sense strand of JG2 (SEQ ID NO: 3) 

(D) OTHER INFORMATION: primes DNA synthesis on the positive 
strand of PT-NANBH genomic RNA/DNA. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCGATAGCCG CAGTTCT 17 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo dl64 

(ix) FEATURE: 

(B) LOCATION: from 1 to 22 bases homologous to bases 10 to 
31 of the sequence in Fig. 2 of Okamoto et al., Japan. J. Exp. Med., 
1990, 60 167-177, base 22 changed from A to T to introduce Bgl2 
recognition site from 8 to 13 bases Bgl2 recognition site 



(D) OTHER INFORMAL©.., primes DNA synthesis o£ a negative 
strand of PT-NANBH genomic RNA/DNA and introduces a Bgl2 
site. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCACCATAGA TCTCTCCCCT GT 22 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo dl37 

(ix) FEATURE: 

(B) LOCATION: from 1 to 30 bases homologous to bases 154 to 
183 of the negative strand of BR11 (SEQ ID NO: 5) bases 174, 
177 and 178 modified to introduce an EcoRl recognition site 
from 5 to 10 bases EcoRl recognition site 

(D) OTHER INFORMATION: primes DNA synthesis on the positive 
strand of PT-NANBH genomic RNA/DNA and 
introduces an EcoRl site for cloning. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GCGAGAATTC GGGATAGGTT GTCGCCTTCC 30 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 



(vii) IMMEDIATE SOURCE© 

(A) LIBRARY:* oligonucleotide synthesizer; oligo dl36 

(ix) FEATURE: - 

(B) LOCATION: from 1 to 27 bases homologous to bases 672 to 
698 of the positive strand of BR11 (SEQ ID N0:5) base 675 
changed to G to introduce an EcoRl recognition site 

from 4 to 9 bases EcoRl recognition site 
(D) OTHER INFORMATION: primes DNA synthesis on the negative 

strand of PT-NANBH genomic RNA/DNA and introduces an EcoRl 
site for cloning. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GGGGAATTCC TCCCGCTGCT GGGTAGC 27 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: chimpanzee; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo dl55 

(ix) FEATURE: 

(B) LOCATION: from 1 to 28 bases homologous to bases 462 to 
489 of the negative strand of figure 47, European Patent 
Application 88310922.5; bases 483 and 485 changed to introduce an 
EcoRl recognition site from 5 to 10 bases EcoRl recognition site 

(D) OTHER INFORMATION: primes DNA synthesis on the positive 
strand of PT-NANBH genomic RNA/DNA and introduces an EcoRl 
site for cloning. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ACGGGAATTC GACCAGGCAC CTGGGTGT 28 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE:^ Jietic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: chimpanzee; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo dl56 

(ix) FEATURE: 

(B) LOCATION: from 1 to 23 bases homologous to bases 3315 to 
3337 of the positive strand of figure 47, European Patent 
Application 88310922.5; base 3323 changed to C to introduce an 
EcoRl recognition site from 4 to 9 bases EcoRl recognition site 

(D) OTHER INFORMATION: primes DNA synthesis on the negative 

strand of PT-NANBH genomic RNA/DNA and introduces an EcoRl 
site for cloning. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CTTGAATTCT GGGAGGGCGT CTT 23 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo d92 

(ix) FEATURE: 

(B) LOCATION: from 1 to 29 bases homologous to bases 36 to 

64 of the negative strand of JG2 (SEQ ID NO:3); bases 57, 
58 and 60 changed to introduce an EcoRl recognition site 
from 5 to 10 bases EcoRl recognition site 
(D) OTHER INFORMATION: primes DNA synthesis on the positive 

strand of PT-NANBH genomic RNA/DNA and introduces an EcoRl 
site for cloning. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CGCCGAATTC ATGCCGCCAC AGGAGGTTG 



29 



j(2) INFORMATION FOR SEQ il 




':18: 




(ii) MOLECULE TYPE: cDNA to genomic RNA 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleotide with corresponding protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 




(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: clone 164/137 

(ix) FEATURE: 

(B) LOCATION: from 308 to 504 bp start of the PT-NANBH 
polyprotein 

(D) OTHER INFORMATION: probably encodes viral structural 
proteins 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GATCACTCCC CTGTGAGGAA CTACTGTCTT CACGCAGAAA GCGTCTAGCC ATGGCGTTAG 

TATGAGTGTC GTGCAGCCTC CAGGACCCCC CCTCCCGGGA GAGCCATAGT GGTCTGCGGA 

ACCGGTGAGT ACACCGGAAT TGCCAGGACG ACCGGGTCCT TTCTTGGATT AACCCGCTCA 

ATGCCTGGAG ATTTGGGCGT GCCCCCGCAA GACTGCTAGC CGAGTAGTGT TGGGTCGCGA 

AAGGCCTTGT GGTACTGCCT GATAGGGTGC TTGCGAGTGC . CCCGGGAGGT CTCGTAGACC 
GTGCACC ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC 
Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn 

5 10 

ACC AAC CGC CGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC 
Thr Asn Pro Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He 
15 20 25 30 

GTT GGT GGA GTT TAC CTG TTG CCG CGC AGG GGC CCC AGG TTG GGT GTG 
Val Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 

35 40 45 

CGC GCG ACT AGG AAG ACT TCC GAG CGG TCG CAA CCT CGT GGA AGG CGA 
Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Aro 
50 55 60 

CAA CCT ATC CC 
Gin Pro He Pro 
65 



60 

120 

180 

240 

300 
349 

397 

445 

493 

504 



{2) INFORMATION FOR SEQ l£ >:19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 base pairs 

(B) TYPE: nucleotide with corresponding protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to genomic RNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: clone 136/155 

(ix) FEATURE: 

(B) LOCATION: from 1 to 1107 bp portion of the PT-NANBH 
polyprotein 

(D) OTHER INFORMATION: probably encodes viral structural 
proteins 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TCC TCC CGC TGC TGG GTA GCG CTC ACT CCC ACG CTC GCG GCC AAG GAC 48 
Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala Lys Asp 

5 10 15 

GCC AGC ATC CCC ACT GCG ACA ATA CGA CGC CAC GTC GAT TTG CTC GTT 96 
Ala Ser lie Pro Thr Ala Thr He Arg Arg His Val Asp Leu Leu Val 
20 25 30 

GGG GCG GCT GCC TTC TGC TCC GCT ATG TAC GTG GGG GAT CTC TGC GGA 144 
Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu Cys Gly 
35 40 45 

TCT GTT TTC CTC GTC TCT CAG CTG TTC ACC TTC TCG CCT CGC CGA CAT 192 
Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg Arg His 
50 55 60 

CAG ACG GTA CAG GAC TGC AAT TGT TCA ATC TAT CCC GGC CAC GTA TCA 240 
Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His Val Ser 
65 70 75 80 

GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCA CCT ACA GCA 288 
Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Ala 

85 90 95 



GCC CTA GTG GTA TCG CAG CTA CTC CGG ATC CCA CAA GCT GTC GTG GAC 
Ala Leu Val Val Ser Gin Leu Leu Arg He Pro Gin Ala Val Val Asp 
100 105 110 



336 



ATG GTG GCG GGG GCC CAC T(£ JGA GTC CTG GCG GGC CTT G^ JAC TAT 

Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr 
115 120 125 



384 



TCC ATG GTG GGG AAC TGG GCT AAG GTC TTG GTT GTG ATG CTA CTC TTT 
Ser Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe 
130 135 140 



432 



GCC GGC GTT GAC GGG GAA CCT TAC ACG ACA GGG GGG ACA CAC GGC CGC 
Ala Gly Val Asp Gly Glu Pro Tyr Thr Thr Gly Gly Thr His Gly Arg 
145 150 155 160 



480 



GCC GCC CAC GGG CTT ACA TCC CTC TTC ACA CCT GGG CCG GCT GAG AAA 
Ala Ala His Gly Leu Thr Ser Leu Phe Thr Pro Gly Pro Ala Gin Lys 

165 170 175 



528 



ATC CAG CTT GTA AAC ACC AAC GGC AGC TGG CAC ATC AAC AGA ACT GCC 
lie Gin Leu Val Asn Thr Asn Gly Ser Trp His lie Asn Arg Thr Ala 
180 185 190 



576 



TTG AAC TGC AAT GAC TCC CTC CAA ACT GGG TTC CTT GCC GCG CTG TTC 
Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Leu Ala Ala Leu Phe 
195 200 205 



624 



TAC ACG CAC AGG TTC AAT GCG TCC GGA TGC TCA GAG CGC ATG GCC AGC 
Tyr Thr His Arg Phe Asn Ala Ser Gly Cys Ser Glu Arg Met Ala Ser 
210 215 220 



672 



TGC CGC CCC ATT GAC CAG TTC GAT CAG GGG TGG GGT CCC ATC ACT TAT 
Cys Arg Pro lie Asp Gin Phe Asp Gin Gly Trp Gly Pro lie Thr Tyr 
225 230 235 240 



720 



AAT GAG TCC CAC GGC TTG GAC CAG AGG CCC TAT TGC TGG CAC TAC GCA 
Asn Glu Ser His Gly Leu Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala 

245 250 255 



768 



CCT CAA CCG TGT GGT ATC GTG CCC GCG TTG CAG GTG TGT GGC CCA GTG 
Pro Gin Pro Cys Gly lie Val Pro Ala Leu Gin Val Cys Gly Pro Val 
260 265 270 



816 



TAC TGT TTC ACT CCA AGC CCT GTT GTG GTG GGG ACG ACC GAT CGT TTC 
Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe 
275 280 285 



864 



GGC GCC CCT ACG TAC AGA TGG GGT GAG AAT GAG ACG GAC GTG CTG CTT 
Gly Ala Pro Thr Tyr Arg Trp Gly Glu Asn Glu Thr Asp Val Leu Leu 
290 295 300 



912 



CTC AAC AAC ACG CGG CCG CCA CGG GGC AAC TGG TTC GGC TGT ACA TGG 
Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp 
305 310 315 320 



960 



ATG AAT AGC 
Met Asn Ser 



ATC GGG GGG 
He Gly Gly 



CGG AAG CAT 
Arg Lys His 
355 

TTG 
Leu 



ACC GGG TTC 
Thr .Gly ?he 
325 

GTC GGC AAC 
Val Gly Asn 
340 

CCC GAG GCC 
Pro Glu Ala 



A(A AG ACG 

Thr j_jys Thr 



AAC ACT TTG 
Asn Thr Leu 
345 

ACT TAC ACC 
Thr Tyr Thr 
360 



TGT GGG GGC 
Cys Gly Gly 
330 

ATC TGC CCC 
He Cys Pro 



AAA TGC GGT 
Lys Cys Gly 



CCC ?GC 
Pro Pro Cys 
335 

ACG GAC TGC 
Thr Asp Cys 
350 

TCG GGG CCT 
Ser Gly Pro 
365 



AAC 1008 
Asn 



TTC 1056 
Phe 



TGG 1104 
Trp 



1107 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2043 base pairs 

(B) TYPE: nucleotide with corresponding protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to genomic RNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: clone 156/92 

(ix) FEATURE: 

(B) LOCATION: from 1 to 2043 bp portion of the PT-NANBH 
polyprotein 

(D) OTHER INFORMATION: probably encodes viral non-structural 
proteins 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TGG GAG GGC GTC TTC ACA GGC CTC ACC CAC GTG GAT GCC CAC TTC CTG 48 
Trp Glu Gly Val Phe Thr Gly Leu Thr His Val Asp Ala His Phe Leu 

5 10 15 

TCC CAA ACA AAG CAG GCA GGA GAC AAC TTC CCC TAC CTG GTG GCG TAC 96 
Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro Tyr Leu Val Ala Tyr 
20 25 30 

CAG GCT ACT GTG TGC GCT AGG GCC CAG GCC CCA CCT CCA TCA TGG GAT 144 
Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp 
35 40 45 



CAA ATG TGG AAG TGT CTC a£ CGG CTA AAG CCT ACT CTG A 3GG CCA 192 

Gin Met Trp Lys. Cys Leu ire Arg Leu Lys Pro Thr Leu Arg Gly Pro 
50 55 60 

ACA CCC TTG CTG TAT AGG CTG GGA GCC GTC CAA AAC GAG GTC ACC CTC 240 
Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin Asn Glu Val Thr Leu 
65 70 75 80 

ACA CAC CCC ATA ACC AAA TTC ATC ATG GCA TGC ATG TCA GCC GAC CTG 288 
Thr His Pro He Thr Lys Phe He Met Ala Cys Met Ser Ala Asp Leu 

85 90 95 

GAG GTC GTC ACG AGC ACC TGG GTG CTG GTG GGC GGG GTC CTT GCA GCT 336 
Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala 
100 105 110 

CTG GCT GCG TAT TGC TTG ACA ACA GGC AGC GTG GTC ATT GTG GGT AGG 384 
Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val He Val Gly Arg 
115 120 125 

ATC ATC TTG TCC GGG CGG CCG GCT ATT GTT CCC GAC AGG GAA GTC CTC 432 
He He Leu Ser Gly Arg Pro Ala He Val Pro Asp Arg Glu Val Leu 
130 135 140 

TAC CAG GAG TTC GAT GAG ATG GAA GAG TGC GCG TCG CAC CTC CCT TAC 480 
Tyr Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr 
145 150 155 160 

ATC GAG CAG GGA ATG CAG CTC GCC GAG CAG TTC AAG CAA AAA GCG CTC 528 
He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu 

165 170 175 

GGG TTG CTG CAG ACA GCC ACC AAG CAA GCG GAG. GCC GCT GCT CCC GTG 576 
Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu Ala Ala Ala Pro Val 
180 185 190 

GTG GAG TCC AAG TGG CGA GCC CTT GAG ACC TTC TGG GCG AAA CAC ATG 624 
Val Glu Ser Lys Trp Arg Ala Leu Glu Thr Phe Trp Ala Lys His Met 
195 200 205 

TGG AAC TTC ATC AGC GGG ATA CAG TAC TTA GCA GGC TTG TCC ACT CTG 672 
Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly Leu Ser Thr Leu 
210 215 220 

CCT GGG AAT CCC GCG ATT GCA TCA CTG ATG GCG TTC ACA GCC TCT GTC 720 
Pro Gly Asn Pro Ala He Ala Ser Leu Met Ala Phe Thr Ala Ser Val 
225 230 235 240 

ACT AGC CCG CTC ACC ACC CAA TCT ACC CTC CTG CTT AAC ATC CTG GGG 768 
Thr Ser Pro Leu Thr Thr Gin Ser Thr Leu Leu Leu Asn He Leu Gly 

245 250 255 



GGA TGG GTA GCC GCC CAA CA SCT CCC CCC AGT GCT GCT ^gfe GCT TTC 816 

Gly Trp Val Ala Ala Gin LeWAla Pro Pro Ser Ala Ala sW Ala Phe 
260' ' 265 270 

GTA GGC GCC GGC ATT GCT GGT GCG GCT GTT GGC AGC ATA GGC CTT GGG 864 
Val Gly Ala Gly lie Ala Gly Ala Ala Val Gly Ser lie Gly Leu Gly 
275 280 285 

AAG GTG CTT GTG GAC ATC TTG GCG GGC TAT GGA GCA GGA GTG GCA GGC 912 
Lys Val Leu Val Asp lie Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly 
290 295 300 

GCG CTC GTG GCC TTT AAG GTC ATG AGC GGC GAA ATG CCC TCC ACC GAG 960 
Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu Met Pro Ser Thr Glu 
305 310 315 320 

GAC CTG GTT AAC TTA CTC CCT GCC ATC CTC TCT CCT GGT GCC CTG GTC 1008 
Asp Leu Val Asn Leu Leu Pro Ala lie Leu Ser Pro Gly Ala Leu Val 

325 330 335 

GTC GGG GTC GTG TGC GCA GCG ATA CTG CGT CGG CAC GTG GGT CCA GGG 1056 
Val Gly Val Val Cys Ala Ala He Leu Arg Arg His Val Gly Pro Gly 
340 345 350 

GAG GGG GCT GTG CAG TGG ATG AAC CGG CTG ATA GCG TTC GCC TCG CGG 1104 
Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala Phe Ala Ser Arg 
355 360 365 

GGT AAC CAT GTT TCC CCC ACG CAC TAT GTG CCA GAG AGC GAC GCC GCA 1152 
Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu Ser Asp Ala Ala 
370 375 380 

GCA CGT GTC ACT CAG ATC CTC TCC GAC CTT ACT ATC ACC CAA CTG TTG 1200 
Ala Arg Val Thr Gin He Leu Ser Asp Leu Thr He Thr Gin Leu Leu 
385 390 395 400 

AAG AGG CTC CAC CAG TGG ATT AAC GAG GAC TGC TCC ACG CCC TGC TCC 1248 
Lys Arg Leu His Gin Trp He Asn Glu Asp Cys Ser Thr Pro Cys Ser 

405 410 415 

GGC TCG TGG CTA AGG GAT GTT TGG GAC TGG ATA TGC ACA GTT TTG GCT 1296 
Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu Ala 
420 425 430 

GAC TTC AAG ACC TGG CTC CAG TCC AAG CTC CTG CCG CGA TTA CCG GGA 1344 
Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro Gly 
435 440 445 

GTC CCC TTT TTC TCA TGC CAA CGT GGG TAC AAG GGG GTC TGG CGG GGA 1392 
Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg Gly 
450 455 460 



GAC GGC ATC ATG CAG ACC aA.^GC TCA TGT GGA GCA CAG \CC GGA 1440 

Asp Gly He Met. Gin Thr Thx Cys Ser Cys Gly Ala Gin ire Thr Gly 
465 470 475 480 

CAT GTC AAA AAC GGT TCC ATG AGG ATC GTT GGG CCT AAG ACC TGT AGT 1488 
His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Lys Thr Cys Ser 

485 490 495 

AAC ATG TGG CAT GGA ACA TTC CCC ATC AAC GCA TAC ACC ACG GGC CCC 1536 
Asn Met Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly Pro 
500 505 510 

TGC ACG CCC TCC CCA GCG CCA AAC TAT TCC AGG GCG CTG TGG CGG GTG 1584 
Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg Val 
515 520 525 

GCT GCT GAG GAG TAC GTG GAG GTT ACG CGG GTG GGG GAT TTC CAC TAC 1632 
Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His Tyr 
530 535 540 

GTG ACG AGC. ATG ACC ACT GAC AAC GTA AAA TGC CCG TGC CAG GTT CCA 1680 
Val Thr Ser Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val Pro 
545 550 555 560 

GCC CCC GAA TTC TTC ACA GAA GTG GAT GGG GTG CGG CTG CAC AGG TAC 1728 
Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg Tyr 

565 570 575 

GCT CCG GCG TGC AAA CCT CTC CTA CGG GAG GAG GTC ACA TTC CAG GTC 1776 
Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Gin Val 
580 585 590 

GGG CTC AAC CAA TAC CTG GTT GGG TCG CAG CTC CCA TGC GAG CCC GAA 1824 
Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro Glu 
595 600 605 

CCG GAT GTA GCA GTG CTC ACT TCC ATG CTC ACC GAC CCC TCC CAC ATC 1872 
Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His He 
610 615 620 

ACA GCA GAG ACG GCT AAG CGC AGG CTG GCC AGG GGG TCT CCC CCC TCC 1920 
Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser 
625 630 635 640 

TTG GCC AGC TCT TCA GCT AGC CAG TTG TCT GCG CCT TCC TCG AAG GCG 1968 
Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Ser Lys Ala 

645 650 655 

ACA TAC ATT ACC CAA AAT GAC TTC CCA GAC GCT GAC CTC ATC GAG GCC 2016 
Thr Tyr He Thr Gin Asn Asp Phe Pro Asp Ala Asp Leu He Glu Ala 
660 665 670 



AAC CTC CTG TGG CGG CAT GA \TG GGC 
Asn Leu Leu Trp Arg His G^r Met Gly 

675 " ' 680 



2043 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2116 base pairs 

(B) TYPE: nucleotide with corresponding protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to genomic RNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA clones from 5' end of the genome 

(ix) FEATURE: 

(B) LOCATION: from 308 to 2116 bp start of the PT-NANBH 
polyprotein 

(D) OTHER INFORMATION: viral structural and non-structural 
proteins 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

GATCACTCCC CTGTGAGGAA CTACTGTCTT CACGCAGAAA GCGTCTAGCC ATGGCGTTAG 60 

TATGAGTGTC GTGCAGCCTC CAGGACCCCC CCTCCCGGGA GAGCCATAGT GGTCTGCGGA 120 

ACCGGTGAGT ACACCGGAAT TGCCAGGACG ACCGGGTCCT TTCTTGGATT AACCCGCTCA 180 

ATGCCTGGAG ATTTGGGCGT GCCCCCGCAA GACTGCTAGC CGAGTAGTGT TGGGTCGCGA 240 

AAGGCCTTGT GGTACTGCCT GATAGGGTGC TTGCGAGTGC CCCGGGAGGT CTCGTAGACC 300 

GTGCACC ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC 349 
Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn 

5 10 



ACC AAC 


CGC 


CGC 


CCA 


CAG 


GAC 


GTC 


AAG 


TTC 


CCG 


GGC 


GGT 


GGT 


CAG 


ATC 


397 


Thr Asn 


Pro 


Arg 


Pro 


Gin 


Asp 


Val 


Lys 


Phe 


Pro 


Gly 


Gly 


Gly 


Gin 


He 




15 








20 










25 








30 




GTT GGT 


GGA 


GTT 


TAC 


CTG 


TTG 


CCG 


CGC 


AGG 


GGC 


CCC 


AGG 


TTG 


GGT 


GTG 


445 


Val Gly 


Gly 


Val 


Tyr 


Leu 


Leu 


Pro 


Arg 


Arg 


Gly 


Pro 


Arg 


Leu 


Gly 


Val 










35 










40 










45 






CGC GCG 


ACT 


AGG 


AAG 


ACT 


TCC 


GAG 


CGG 


TCG 


CAA 


CCT 


CGT 


GGA 


AGG 


CGA 


493 


Arg Ala 


Thr 


Arg 


Lys 


Thr 


Ser 


Glu 


Arg 


Ser 


Gin 


Pro 


Arg 


Gly 


Arg 


Arg 








50 










55 










60 









CAA CCT ATC CCC AAG GCT CgA'AG CCC GAG GGC AGG GCC T<A 1CT CAG 541 
Gin Pro lie Pro Lys Ala Arg^Gln Pro Glu Gly Arg Ala Tr^Ala Gin 
65 70 75 

CCC GGG TAC CCT TGG CCC CTC TAT GGC AAC GAG GGC ATG GGG TGG GCA 589 
Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala 
80 85 90 

GGA TGG CTC CTG TCA CCC CGT GGC TCC CGG CCT AGT TGG GGC CCC ACT 637 
Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 
100 105 110 115 

GAC CCC CGG CGT AGG TCG CGT AAT TTG GGT AAA GTC ATC GAT ACC CTC 685 
Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu 

120 125 130 

ACA TGC GGC TTC GCC GAC CTC ATG GGG TAC ATT CCG CTC GTC GGC GCT 733 
Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala 
135 140 145 

CCC TTA GGG GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG 781 
Pro Leu Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 
150 155 160 

GAG GAC GGC GTG AAC TAT GCA ACA GGG AAT TTA CCC GGT TGC TCT TTC 829 
Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
165 170 175 

TCT ATC TTC CTC TTG GCT TTG CTG TCC TGT TTG ACC ATT CCA GCT TCC 877 
Ser lie Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr lie Pro Ala Ser 
180 185 190 195 

GCT TAT GAA GTG CGC AAC GTG TCC GGG ATC TAC CAT GTC ACG AAC GAT 925 
Ala Tyr Glu Val Arg Asn Val Ser Gly lie Tyr His Val Thr Asn Asp 

200 205 210 

TGC TCC AAC TCA AGC ATC GTG TAC GAG ACA GCG GAC ATG ATC ATG CAC 973 
Cys Ser Asn Ser Ser lie Val Tyr Glu Thr Ala Asp Met lie Met His 
215 220 225 

ACC CCC GGG TGT GTG CCC TGT GTC CGG GAG GGT AAT TCC TCC CGC TGC 1021 
Thr Pro Gly Cys Val Pro Cys Val Arg Glu Gly Asn Ser Ser Arg Cys 
230 235 240 

TGG GTA GCG CTC ACT CCC ACG CTC GCG GCC AAG GAC GCC AGC ATC CCC 1069 
Trp Val Ala Leu Thr Pro Thr Leu Ala Ala Lys Asp Ala Ser lie Pro 
245 250 255 

ACT GCG ACA ATA CGA CGC CAC GTC GAT TTG CTC GTT GGG GCG GCT GCC 1117 
Thr Ala Thr lie Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala 
260 265 270 275 



TTC TGC TCC GCT ATG TAC ^GG GAT CTC TGC GGA TCT (A TTC CTC 1165 

Phe Cys Ser Ala Met Tyr Va^ Gly Asp Leu Cys Gly Ser vff[ Phe Leu 
' 280 ' 285 290 

GTC TCT CAG CTG TTC ACC TTC TCG CCT CGC CGA CAT CAG ACG GTA CAG 1213 
Val Ser Gin Leu Phe Thr Phe Ser Pro Arg Arg His Gin Thr Val Gin 
295 300 305 

GAC TGC AAT TGT TCA ATC TAT CCC GGC CAC GTA TCA GGT CAC CGC ATG 1261 
Asp Cys Asn Cys Ser lie Tyr Pro Gly His Val Ser Gly His Arg Met 
310 315 320 

GCT TGG GAT ATG ATG ATG AAC TGG TCA CCT ACA GCA GCC CTA GTG GTA 1309 
Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Ala Ala Leu Val Val 
325 330 335 

TCG CAG CTA CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG 1357 
Ser Gin Leu Leu Arg lie Pro Gin Ala Val Val Asp Met Val Ala Gly 
340 345 350 355 

GCC CAC TGG GGA GTC CTG GCG GGC CTT GCC TAC TAT TCC ATG GTG GGG 1405 
Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly 

360 365 370 

AAC TGG GCT AAG GTC TTG GTT GTG ATG CTA CTC TTT GCC GGC GTT GAC 1453 
Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp 
375 380 385 

GGG GAA CCT TAC ACG ACA GGG GGG ACA CAC GGC CGC GCC GCC CAC GGG 1501 
Gly Glu Pro Tyr Thr Thr Gly Gly Thr His Gly Arg Ala Ala His Gly 
390 395 400 

CTT ACA TCC CTC TTC ACA CCT GGG CCG GCT CAG AAA ATC CAG CTT GTA 1549 
Leu Thr Ser Leu Phe Thr Pro Gly Pro Ala Gin Lys lie Gin Leu Val 
405 410 415 

AAC ACC AAC GGC AGC TGG CAC ATC AAC AGA ACT GCC TTG AAC TGC AAT 1597 
Asn Thr Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn 
420 425 430 435 

GAC TCC CTC CAA ACT GGG TTC CTT GCC GCG CTG TTC TAC ACG CAC AGG 1645 
Asp Ser Leu Gin Thr Gly Phe Leu Ala Ala Leu Phe Tyr Thr His Arg 

440 445 450 

TTC AAT GCG TCC GGA TGC TCA GAG CGC ATG GCC AGC TGC CGC CCC ATT 1693 
Phe Asn Ala Ser Gly Cys Ser Glu Arg Met Ala Ser Cys Arg Pro lie 
455 460 ' 465 

GAC CAG TTC GAT CAG GGG TGG GGT CCC ATC ACT TAT AAT GAG TCC CAC 1741 
Asp Gin Phe Asp Gin Gly Trp Gly Pro lie Thr Tyr Asn Glu Ser His 
470 475 480 



GGC TTG GAC CAG AGG CCC TiA lGC TGG CAC TAC GCA CCT 3CG TGT 1789 

Gly Leu Asp Gin «Arg ?ro TylrCys Trp His Tyr Ala Pro GlW Pro Cys 
485 490 495 



GGT ATC GTG 
Gly He Val 
500 

CCA AGC CCT 
Pro Ser Pro 



TAC AGA TGG 
Tyr Arg Trp 



CGG CCG CCA 
Arg Pro Pro 
550 

GGG TTC ACC 
Gly Phe Thr 
565 

GGC AAC AAC 
Gly Asn Asn 
580 

GAG GCC ACT 
Glu Ala Thr 



CCC GCG TTG 
Pro Ala Leu 
505 

GTT GTG GTG 
Val Val Val 
520 

GGT GAG AAT 
Gly Glu Asn 
535 

CGG GGC AAC 
Arg Gly Asn 



AAG ACG TGT 
Lys Thr Cys 



ACT TTG ATC 
Thr Leu He 
585 

TAC ACC AAA 
Tyr Thr Lys 
600 



CAG GTG TGT 
Gin Val Cys 



GGG ACG ACC 
Gly Thr Thr 



GAG ACG GAC 
Glu Thr Asp 
540 

TGG TTC GGC 
Trp Phe Gly 
555 

GGG GGC CCC 
Gly Gly Pro 
570 

TGC CCC ACG 
Cys Pro Thr 



TGC GGT TCG 
Cys Gly Ser 



GGC CCA GTG 
Gly Pro Val 
510 

GAT CGT TTC 
Asp Arg Phe 
525 

GTG CTG CTT 
Val Leu Leu 



TGT ACA TGG 
Cys Thr Trp 



CCG TGC AAC 
Pro Cys Asn 
575 

GAC TGC TTC 
Asp Cys Phe 
590 

GGG CCT TGG 
Gly Pro Trp 
605 



TAC TGT TTC 
Tyr Cys Phe 



GGC GCC CCT 
Gly Ala Pro 
530 

CTC AAC AAC 
Leu Asn Asn 
545 

ATG AAT AGC 
Met Asn Ser 
560 

ATC GGG GGG 
He Gly Gly 



CGG AAG CAT 
Arg Lys His 



TTG 
Leu 



ACT 1837 

Thr 

515 

ACG 1885 
Thr 



ACG 1933 
Thr 



ACC 1981 
Thr 



GTC 2029 
Val 



CCC 2077 

Pro 

595 

2116 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3750 base pairs 

(B) TYPE: nucleotide with corresponding protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to genomic RNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human; serum infectious for PT-NANBH 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA clones from 3' end of the genome 

( ix ) FEATURE : 

(B) LOCATION: from 1 to 3750 bp portion of the PT-NANBH 
polyprotein 

(D) OTHER INFORMATION : viral non-structural proteins 



* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



TGG 
Trp 


GAG 
Glu 


GGC 
Gly 


GTC 
Val 


TTC 
Phe 
5 


ACA 
Thr 


GGC 
Gly 


CTC 
Leu 


ACC 
Thr 


CAC 
His 
10 


GTG 
Val 


GAT 
Asp 


GCC 
Ala 


CAC 
His 


TTC 
Phe 
15 


CTG 
Leu 


48 


TCC 
Ser 


CAA 
Gin 


ACA 
Thr 


AAG 
Lys 
20 


CAG 
Gin 


GCA 
Ala 


GGA 
Gly 


GAC 
Asp 


AAC 
Asn 
25 


TTC 
Phe 


CCC 
Pro 


TAC 
Tyr 


CTG 
Leu 


GTG 
Val 
30 


GCG 
Ala 


TAC 
Tyr 


96 


CAG 
Gin 


GCT 
Ala 


ACT 
Thr 
35 


GTG 
Val 


TGC 
Cys 


GCT 
Ala 


AGG 
Arg 


GCC 
Ala 
40 


CAG 
Gin 


GCC 
Ala 


CCA 
Pro 


CCT 
Pro 


CCA 
Pro 
45 


TCA 
Ser 


TGG 
Trp 


GAT 
Asp 


144 


CAA 
Gin 


ATG 
Met 
50 


TGG 
Trp 


AAG 
Lys 


TGT 
Cys 


CTC 
Leu 


ATA 
He 
55 


CGG 
Arg 


CTA 
Leu 


AAG 
Lys 


CCT 
Pro 


ACT 
Thr 
60 


CTG 
Leu 


CGC 
Arg 


GGG 
Gly 


CCA 
Pro 


192 


ACA 
Thr 
65 


CCC 
Pro 


TTG 
Leu 


CTG 
Leu 


TAT 
Tyr 


AGG 
Arg 
70 


CTG 
Leu 


GGA 
Gly 


GCC 
Ala 


GTC 
Val 


CAA 
Gin 
75 


AAC 
Asn 


GAG 
Glu 


GTC 
Val 


ACC 
Thr 


CTC 
Leu 
80 


240 


ACA 
Thr 

GAG 
Glu 


CAC 
His 

GTC 
Val 


CCC 
Pro 

GTC 
Val 


ATA 
He 

ACG 
Thr 
100 


ACC 
Thr 

oc 

OJ 

AGC 
Ser 


AAA 
Lys 

ACC 
Thr 


TTC 
Phe 

TGG 
Trp 


ATC 
lie 

GTG 
Val 


ATG 
Met 

CTG 
Leu 
105 


GCA 
Ala 

J u 

GTG 
Val 


TGC 
Cys 

GGC 
Gly 


ATG 
Met 

GGG 
Gly 


TCA 
Ser 

GTC 
Val 


GCC 
Ala 

CTT 
Leu 
110 


GAC 
Asp 
y d 
GCA 
Ala 


CTG 
Leu 

GCT 
Ala 


288 
336 


CTG 
Leu 


GCT 
Ala 


GCG 
Ala 
115 


TAT 
Tyr 


TGC 
Cys 


TTG 
Leu 


ACA 
Thr 


ACA 
Thr 
120 


GGC 
Gly 


AGC 
Ser 


GTG 
Val 


GTC 
Val 


ATT 
He 
125 


GTG 
Val 


GGT 
Gly 


AGG 
Arg 


384 


ATC 
He 


ATC 
He 
130 


TTG 
Leu 


TCC 
Ser 


GGG 
Gly 


CGG 
Arg 


CCG 
Pro 
135 


GCT 
Ala 


ATT 
He 


GTT 
Val 


CCC 
Pro 


GAC 
Asp 
140 


AGG 
Arg 


GAA 
Glu 


GTC 
Val 


CTC 
Leu 


432 


TAC 
Tyr 
145 


CAG 
Gin 


GAG 
Glu 


TTC 
Phe 


GAT 
Asp 


GAG 
Glu 
150 


ATG 
Met 


GAA 
Glu 


GAG 
Glu 


TGC 
Cys 


GCG 
Ala 
155 


TCG 
Ser 


CAC 
His 


CTC 
Leu 


CCT 
Pro 


TAC 
Tyr 
160 


480 


ATC 
He 


GAG 
Glu 


CAG 
Gin 


GGA 
Gly 


ATG 
Met 
165 


CAG 
Gin 


CTC 
Leu 


GCC 
Ala 


GAG 
Glu 


CAG 
Gin 
170 


TTC 
Phe 


AAG 
Lys 


CAA 
Gin 


AAA 
Lys 


GCG 
Ala 
175 


CTC 
Leu 


528 


GGG 
Gly 


TTG 
Leu 


CTG 
Leu 


CAG 
Gin 
180 


ACA 
Thr 


GCC 
Ala 


ACC 
Thr 


AAG 
Lys 


CAA 
Gin 
185 


GCG 
Ala 


GAG 
Glu 


GCC 
Ala 


GCT 
Ala 


GCT 
Ala 
190 


CCC 
Pro 


GTG 
Val 


576 


GTG 


GAG 


TCC 


AAG 


TGG 


CGA 


GCC 


CTT 


GAG 


ACC 


TTC 


TGG 


GCG 


AAA 


CAC 


ATG 


624 



Val Glu Ser Lys Trp Arg Ala Leu Glu Thr Phe Trp Ala Lys His Met 
195 200 205 



TGG AAC TTC ATC AGC GGG ATJ^AG TAC TTA GCA GGC TTG TCJ^\CT CTG 672 
Trp Asn Phe lie Ser Gly IjJRln Tyr Leu Ala Gly Leu SeWrhr Leu 
210 " * 215 220 

CCT GGG AAT CCC GCG ATT GCA TCA CTG ATG GCG TTC ACA GCC TCT GTC 720 
Pro Gly Asn Pro Ala lie Ala Ser Leu Met Ala Phe Thr Ala Ser Val 
225 230 235 240 

ACT AGC CCG CTC ACC ACC CAA TCT ACC CTC CTG CTT AAC ATC CTG GGG 768 
Thr Ser Pro Leu Thr Thr Gin Ser Thr Leu Leu Leu Asn lie Leu Gly 

245 250 255 

GGA TGG GTA GCC GCC CAA CTC GCT CCC CCC AGT GCT GCT TCA GCT TTC 816 
Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser Ala Ala Ser Ala Phe 
260 265 270 

GTA GGC GCC GGC ATT GCT GGT GCG GCT GTT GGC AGC ATA GGC CTT GGG 864 
Val Gly Ala Gly He Ala Gly Ala Ala Val Gly Ser lie Gly Leu Gly 
275 280 285 

AAG GTG CTT GTG GAC ATC TTG GCG GGC TAT GGA GCA GGA GTG GCA GGC 912 
Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly 
290 295 300 

GCG CTC GTG GCC TTT AAG- GTC ATG AGC GGC GAA ATG CCC TCC ACC GAG 960 
Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu Met Pro Ser Thr Glu 
305 310 315 320 

GAC CTG GTT AAC TTA CTC CCT GCC ATC CTC TCT CCT GGT GCC CTG GTC 1008 
Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser Pro Gly Ala Leu Val 

325 330 335 

GTC GGG GTC GTG TGC GCA GCG ATA CTG CGT CGG CAC GTG GGT CCA GGG 1056 
Val Gly Val Val Cys Ala Ala He Leu Arg Arg His Val Gly Pro Gly 
340 345 350 

GAG GGG GCT GTG CAG TGG ATG AAC CGG CTG ATA GCG TTC GCC TCG CGG 1104 
Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala Phe Ala Ser Arg 
355 360 365 

GGT AAC CAT GTT TCC CCC ACG CAC TAT GTG CCA GAG AGC GAC GCC GCA 1152 
Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu Ser Asp Ala Ala 
370 375 380 

GCA CGT GTC ACT CAG ATC CTC TCC GAC CTT ACT ATC ACC CAA CTG TTG 1200 
Ala Arg Val Thr Gin He Leu Ser Asp Leu Thr He Thr Gin Leu Leu 
385 390 395 400 

AAG AGG CTC CAC CAG TGG ATT AAC GAG GAC TGC TCC ACG CCC TGC TCC 1248 
Lys Arg Leu His Gin Trp He Asn Glu Asp Cys Ser Thr Pro Cys Ser 

405 410 415 



t 



GGC TCG TGG CTA AGG GAT Glft-GG GAC TGG ATA TGC ACA GtWx'TG GCT 1296 
Gly Ser Trp Leu Arg Asp Val Trp Asp Trp lie Cys Thr Val Leu Ala 
420 425 430 

GAC TTC AAG ACC TGG CTC CAG TCC AAG CTC CTG CCG CGA TTA CCG GGA 1344 
Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro Gly 
435 440 445 

GTC CCC TTT TTC TCA TGC CAA CGT GGG TAC AAG GGG GTC TGG CGG GGA 1392 
Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg Gly 
450 455 460 

GAC GGC ATC ATG CAG ACC ACC TGC TCA TGT GGA GCA CAG ATC ACC GGA 1440 
Asp Gly lie Met Gin Thr Thr Cys Ser Cys Gly Ala Gin lie Thr Gly 
465 470 475 480 

CAT GTC AAA AAC GGT TCC ATG AGG ATC GTT GGG CCT AAG ACC TGT AGT 1488 
His Val Lys Asn Gly Ser Met Arg lie Val Gly Pro Lys Thr Cys Ser 

485 490 495 

AAC ATG TGG CAT GGA ACA TTC CCC ATC AAC GCA TAC ACC ACG GGC CCC 1536 
Asn Met Trp His Gly Thr Phe Pro lie Asn Ala Tyr Thr Thr Gly Pro 
500 505 510 

TGC ACG CCC TCC CCA GCG CCA AAC TAT TCC AGG GCG CTG TGG CGG GTG 1584 
Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg Val 
515 520 525 

GCT GCT GAG GAG TAC GTG GAG GTT ACG CGG GTG GGG GAT TTC CAC TAC 1632 
Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His Tyr 
530 535 540 

GTG ACG AGC ATG ACC ACT GAC AAC GTA AAA TGC CCG TGC CAG GTT CCA 1680 
Val Thr Ser Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val Pro 
545 550 555 560 

GCC CCC GAA TTC TTC ACA GAA GTG GAT GGG GTG CGG CTG CAC AGG TAC 1728 
Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg Tyr 

565 570 575 

GCT CCG GCG TGC AAA CCT CTC CTA CGG GAG GAG GTC ACA TTC CAG GTC 1776 
Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Gin Val 
580 585 590 

3GG CTC AAC CAA TAC CTG GTT GGG TCG CAG CTC CCA TGC GAG CCC GAA 1824 
Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro Glu 
595 600 605 

CCG GAT GTA GCA GTG CTC ACT TCC ATG CTC ACC GAC CCC TCC CAC ATC 1872 
Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His He 
610 615 620 



ACA GCA GAG ACG GCT AAG CG^\GG CTG GCC AGG GGG TCT CO^~CC TCC 1920 
Thr Ala Glu Thr Ala Lys Arg^Lrg Leu Ala Arg Gly Ser PrWPro Ser 
625 630 635 640 

TTG GCC AGC TCT TCA GCT AGC CAG TTG TCT GCG CCT TCC TCG AAG GCG 1968 
Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Ser Lys Ala 

645 650 655 

ACA TAC ATT ACC CAA AAT GAC TTC CCA GAC GCT GAC CTC ATC GAG GCC 2016 
Thr Tyr lie Thr Gin Asn Asp Phe Pro Asp Ala Asp Leu lie Glu Ala 
660 665 670 

AAC CTC CTG TGG CGG CAT GAG ATG GGC GGG GAC ATT ACC CGC GTG GAG 2064 
Asn Leu Leu Trp Arg His Glu Met Gly Gly Asp lie Thr Arg Val Glu 
675 680 685 

TCA GAG AAC AAG GTA GTA ATC CTG GAC TCT TTC GAC CCG CTC CGA GCG 2112 
Ser Glu Asn Lys Val Val lie Leu Asp Ser Phe Asp Pro Leu Arg Ala 
690 695 700 

GAG GAG GAT GAG CGG GAA GTG TCC GTC CCG GCG GAG ATC CTG CGG AAA 2160 
Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu lie Leu Arg Lys 
705 710 715 720 

TCC AAG AAA TTC CCA CCA GCG ATG CCC GCA TGG GCA CGC CCG GAT TAC 2208 
Ser Lys Lys Phe Pro Pro Ala Met Pro Ala Trp Ala Arg Pro Asp Tyr 

725 730 735 

AAC CCT CCG CTG CTG GAG TCC TGG AAG GCC CCG GAC TAC GTC CCT CCA 2256 
Asn Pro Pro Leu Leu Glu Ser Trp Lys Ala Pro Asp Tyr Val Pro Pro 
740 745 750 

GTG GTA CAT GGG TGC CCA CTG CCA CCT ACT AAG ACC CCT CCT ATA CCA 2304 
Val Val His. Gly Cys Pro Leu Pro Pro Thr Lys Thr Pro Pro lie Pro 
755 760 765 

CCT CCA CGG AGG AAG AGG ACA GTT GTT CTG ACA GAA TCC ACC GTG TCT 2352 
Pro Pro Arg Arg Lys Arg Thr Val Val Leu Thr Glu Ser Thr Val Ser 
770 775 780 

TCT GCC CTG GCG GAG CTT GCC ACA AAG GCT TTC GGT AGC TCC GAA CCG 2400 
Ser Ala Leu Ala Glu Leu Ala Thr Lys Ala Phe Gly Ser Ser Glu Pro 
785 790 795 800 

TCG GCC GTC GAC AGC GGC ACG GCA ACC GCC CCT CCT GAC CAA CCC TCC 2448 
Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Pro Pro Asp Gin Pro Ser 

805 810 815 

GAC GAC GGC GGA GCA GGA TCT GAC GTT GAG TCG TAT TCC TCC ATG CCC 2496 
Asp Asp Gly Gly Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met Pro 
820 825 830 



CCC CTT GAG GGG GAG.CCG GG^^AC CCC GAT CTC AGC GAC GA XT TGG 2544 
Pro Leu Glu Gly Glu Pro Gl^Risp Pro Asp Leu Ser Asp Gl^Ser Trp 

835 * * 840 845 

TCT ACC GTG AGT GAG GAG GCC GGT GAG GAC GTC GTC TGC TGC TCG ATG 2592 
Ser Thr Val Ser Glu Glu Ala Gly Glu Asp Val Val Cys Cys Ser Met 
850 855 860 

TCC TAC ACA TGG ACA GGC GCT CTG ATC ACG CCA TGC GCT GCG GAG GAA 2640 
Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro Cys Ala Ala Glu Glu 
865 870 875 . 880 

AGC AAG CTG CCC ATC AAC GCG TTG AGC AAC TCT TTG CTG CGT CAC CAC 2688 
Ser Lys Leu Pro lie Asn Ala Leu Ser Asn Ser Leu Leu Arg His His 

885 890 895 

AAC ATG GTC TAC GCT ACC ACA TCC CGC AGC GCA AGC CAG CGG CAG AAG 2736 
Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gin Arg Gin Lys 
900 905 910 

AAG GTC ACC TTT GAC AGA CTG CAA ATC CTG GAC GAT CAC TAC CAG GAC 2784 
Lys Val Thr Phe Asp Arg Leu Gin lie Leu Asp Asp His Tyr Gin Asp 
915 920 925 

GTG CTC AAG GAG ATG AAG GCG AAG GCG TCC ACA GTT AAG GCT AAG CTT 2832 
Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr Val Lys Ala Lys Leu 
930 935 940 

CTA TCA GTA GAG GAA GCC TGC AAG CTG ACG CCC CCA CAT TCG GCC AAA 2880 
Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro His Ser Ala Lys 
945 950 955 960 

TCT AAA TTT GGC TAT GGG GCA AAG GAC GTC CGG AAC CTA TCC AGC AAG 2928 
Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu Ser Ser Lys 

965 970 975 

GCC ATT AAC CAC ATC CGC TCC GTG TGG GAG GAC TTG TTG GAA GAC ACT 2976 
Ala lie Asn His lie Arg Ser Val Trp Glu Asp Leu Leu Glu Asp Thr 
980 985 990 

GAA ACA CCA ATT GAC ACC ACC ATC ATG GCA AAA AAT GAG GTT TTC TGC 3024 
Glu Thr Pro lie Asp Thr Thr lie Met Ala Lys Asn Glu Val Phe Cys 
995 1000 1005 

GTC CAA CCA GAG AGA GGA GGC CGC AAG CCA GCT CGC CTT ATC GTG TTC 3072 
Val Gin Pro Glu Arg Gly Gly Arg Lys Pro Ala Arg Leu lie Val Phe 
1010 1015 1020 

CCA GAC TTG GGG GTC CGT GTG TGC GAG AAA ATG GCC CTC TAT GAC GTG 3120 
Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val 
1025 1030 1035 1040 



V 



GTC TCC ACC CTC CCT CAG GC& 1TG ATG GGC TCC TCG TAC Gj^ 'TTC CAG 3168 
Val Ser Thr Leu Pro Gin y al Met Gly Ser Ser Tyr dp Phe Gin 

2045 ' 1050 1055 

TAT TCT CCT GGA CAG CGG GTC GAG TTC CTG GTG AAC GCC TGG AAA TCA 3216 
Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn Ala Trp Lys Ser 
1060 1065 1070 

AAG AAG ACC CCT ATG GGC TTT GCA TAT GAC ACC CGC TGT TTT GAC TCA 3264 
Lys Lys Thr Pro Met Gly Phe Ala Tyr Asp Thr Arg Cys Phe Asp Ser 
1075 1080 1085 

ACA GTC ACT GAG AAT GAC ATC CGT GTA GAG GAG TCA ATT TAT CAA TGT 3312 
Thr Val Thr Glu Asn Asp lie Arg Val Glu Glu Ser lie Tyr Gin Cys 
1090 1095 1100 

TGT GAC TTG GCC CCC GAA GCC AGA CAG GCC ATA AGG TCG CTC ACA GAG 3360 
Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala lie Arg Ser Leu Thr Glu 
1105 1110 1115 1120 

CGG CTT TAT ATC GGG GGT CCC CTG ACT AAT TCA AAA GGG CAG AAC TGC 3408 
Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly Gin Asn Cys 
1125 1130 1135 

GGC TAT CGC CGG TGC CGC GCG AGC GGC GTG CTG ACG ACT AGC TGC GGT 3456 
Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys Gly 
1140 1145 1150 

AAT ACC CTC ACA TGT TAC TTG AAG GCC TCT GCA GCC TGT CGA GCT GCA 3504 
Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser. Ala Ala Cys Arg Ala Ala 
1155 1160 1165 

AAG CTC CAG GAC TGC ACG ATG CTC GTG TGC GGA GAC GGC CTT GTC GTT 3552 
Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val Val 
1170 1175 1180 

ATC TGT GAG AGC GCG GGA ACC CAG GAG GAC GCG GCG AGC CTA CGA GTC 3600 
He Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala Ser Leu Arg Val 
1185 1190 1195 1200 

TTC ACG GAG GCT ATG ACT AGG TAC TCT GCC CCC CCC GGG GAC CCG CCC 3648 
Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro 
1205 1210 1215 

CAA CCA GAA TAC GAC CTG GAG TTG ATA ACA TCA TGC TCC TCC AAT GTG 3696 
Gin Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val 
1220 1225 1230 

TCG GTC GCG CAC GAT GCA TCT GGC AAA AGG GTA TAC TAC CTC ACC CGT 3744 
Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu Thr Arg 
1235 1240 1245 



GAC CC6 
Asp Pro 
1250 




3750 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: baculovirus AcNPV 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo d24 

(ix) FEATURE: 

(B) LOCATION: from 1 to 23 bases homologous to portion of 
AcNPV polyhedrin gene downstream of the BamHl cloning 
site in pAc360 and similar vectors 

(D) OTHER INFORMATION: primes DNA synthesis from baculovirus 
transfer vector sequences which flank DNA inserted at the 
BamHl site. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CGGGTTTAAC ATTACGGATT TCC 23 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: baculovirus AcNPV 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo dl26 



. (ix) FEATURE: At 

(B) LOCATION* fronWto 31 bases homologous to^he upstream 
junction sequences produced when cDNA amplified by d75 (SEQ ID NO: 5) is 
cloned into the BamHl cloning site in pAc360 and similar vectors;* 
mismatches at bases 13 and 14 introduce a Pstl site from 1 to 10 'bases 
(D) OTHER INFORMATION: primes DNA synthesis at the junction of' 
baculovirus transfer vector sequences and sequences 
previously amplified by oligo d75; introduces a Pstl 
recognition site for subsequent cloning work 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

TAAGGATCCC CCT GCA GTA TCG GCG GAA TTC 31 
Ser Ala Val Ser Ala Glu Phe 

5 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 bases 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: synthetic DNA 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: N/A 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide synthesizer; oligo dl32 

( ix ) FEATURE : 

(B) LOCATION: form 5 to 10 bases Pstl recognition site 
from 13 to 27 bases linker coding for five Lys residues 
from 28 to 45 bases homologous to bases 4 to 21 of BR11 
(SEQ ID NO: 7) 

(D) OTHER INFORMATION: primes DNA synthesis at the 5' end of 
BR11 and introduces a synthetic sequence which codes for 
five lysines as well as a Pstl recognition site for 
subsequent cloning work 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CTGCCTGCA GTA AAG AAG AAG AAG AAG AAA ACC AAA CGT AAC ACC A 46 
Val Lys Lys Lys Lys Lys Lys Thr Lys Arg Asn Leu 

5 10 
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